An extended relational document retrieval model
نویسنده
چکیده
Relational Data Base Management Systems offer a commercially available tool with which to build effective document retrieval systems. The full potential of the relational model for supporting the kind of ad hoc inquiry characteristic of document retrieval has only recently been explored. In addition, commercially available relational DBMS’s also provide effective tools for managing document data bases by providing facilities for, inter alia, concurrency control, data migration and reorganization routines, authorization mechanisms, enforcement of integrity constraints, dynamic data definition, etc. This article will present a relational logical model to support a sophisticated document retrieval system in which flexible forms of inferential and associative searching can be performed. Examples of ad hoc inquiry will be presented in SQL. Several problems of particular importance to document retrieval will be discussed, including the importance of Conjunctive Normal Form in query formulation, unique aspects of document retrieval storage and processing overhead, and techniques for reducing the size of storage without severely impacting retrieval effectiveness.
منابع مشابه
Integrating INQUERY with an RDBMS to Support Text Retrieval
Information is a combination of structured data and unstructured data. Traditionally, relational database management systems (RDBMS) have been designed to handle structured data. IR systems can handle text (unstructured data) very well but are not designed to handle structured data. With present day information being a combination of structured and unstructured data, there is an increasing dema...
متن کاملUsing the Relational Model and Part-of-Speech Tagging to Implement Text Relevance
We introduce a database design that improves prior work on document retrieval within the relational model. While previous approaches require extensions to the relational model, our approach uses an unchanged relational system. We focus on the implementation of assigning a measure of relevance between a query and a document as this is more useful for large document databases. Since run-time perf...
متن کاملEvaluation of object-relational database systems for fulltext retrieval
Object-relational database systems add object-oriented features to relational DBMS and allow the DBMS’s functionality to be extended to new application domains. For the important domain of fulltext retrieval and document management, we analyze whether current object-relational DBMS are already able to compete with specialized information retrieval (IR) systems. After discussing the main require...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملAn NF2 Relational Interface with Aggregation Capability for Document Retrieval, Restructuring and Analysis
Complex documents are used in many environments, e.g., information retrieval (IR). Such documents contain subdocuments, which may contain further subdocuments, etc. Powerful tools are needed to facilitate their retrieval, restructuring, and analysis. Existing IR systems are poor in complex document restructuring and data aggregation. However, in practice, IR system users would often want to obt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Process. Manage.
دوره 24 شماره
صفحات -
تاریخ انتشار 1988